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Claim Amendments 

1 . (Currently amended) A method comprising: 

creating a suffix tree to determine the frequency of phrases within a text 

corpus; 

specifying a set of frequently occurring phrases; and 

filtering the set of frequently occurring phrases to determine a set of 

frequently occurring and unrecognized phrases as e ntity name and jargon term 

candidates. 

2. (Original) The method of claim 1 further comprising: 

sorting each phrase of the set of frequently occurring phrases in inverse 
lexicographical order prior to filtering the set of frequently occurring phrases. 

3. (Original) The method of claim 1 wherein the text corpus is 
preprocessed. 

4. (Original) The method of claim 3 wherein the text corpus is text of a 
human language. 
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5. (Original) The method of claim 4 wherein the human language is 
Chinese. 

6. (Original) The method of claim 4 wherein filtering the set of frequently 
occurring phrases includes comparing a component word of a phrase to a dictionary 
of common words and excluding the phrase from the set of entity name and jargon 
term candidates if the component word is a common word. 

7. (Original) The method of claim 4 further comprising: 

reducing the set of entity name and jargon term candidates by applying 
natural language processing rules. 

8. (Currently amended) The method of claim 4 7 wherein the natural 
language processing rules are rules selected from the list consisting of 
morphological rules, semantic rules, and syntactic rules. 

9. (Currently amended) A machine-readable medium containing 
instructions which, when executed by a processor, cause the processor to perform a 
method, the method comprising: 

creating a suffix tree to determine the frequency of phrases within a text 

corpus; 

specifying a set of frequently occurring phrases; and 

3 



Intel Corporation 

Docket: P11917 



App. No. 10/017,408 



filtering the set of frequently occurring phrases to determine a set of 
frequently occurring and unrecognized phrases as entity name and jargon term 
candidates. 

10. (Original) The machine-readable medium of claim 9 wherein the method 
further comprises: 

sorting each phrase of the set of frequently occurring phrases in inverse 
lexicographical order prior to filtering the set of frequently occurring phrases. 

1 1 . (Original) The machine-readable medium of claim 9 wherein the text 
corpus is preprocessed. 

12. (Original) The machine-readable medium of claim 1 1 wherein the text 
corpus is text of a human language. 

13. (Original) The machine-readable medium of claim 12 wherein the human 
language is Chinese. 

14. (Original) The machine-readable medium of claim 12 wherein filtering the 
set of frequently occurring phrases includes comparing a component word of a 
phrase to a dictionary of common words and excluding the phrase from the set of 
entity name and jargon term candidates if the component word is a common word. 
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15. (Original) The machine-readable medium of claim 12 wherein the method further 
comprises: 

reducing the set of entity name and jargon term candidates by applying 
natural language processing rules. 

16. (Currently amended) The machine-readable medium of claim 45 15 
wherein the natural language processing rules are rules selected from the list 
consisting of morphological rules, semantic rules, and syntactic rules. 

1 7. (Currently amended) A system comprising: 

a memory having stored therein executable instructions which when 
executed by a processor, cause the processor to perform operations comprising: 
creating a suffix tree data structure, the suffix tree data structure 

storing 

phrase frequency data for a text corpus; 

using the phrase frequency data to specify a set of frequently occurring 
phrases; and 

filtering the set of frequently occurring phrases to determine a set of 
frequently occurring and unrecognized phrases as entity name and jargon 
term candidates ; and 
a processor to execute the instructions. 
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18. (Original) The system of claim 17 wherein the operations further comprise: 
sorting each phrase of the set of frequently occurring phrases in inverse 
lexicographical order prior to filtering the set of frequently occurring phrases. 

19. (Original) The system of claim 17 wherein the text corpus is 
preprocessed. 

20. (Original) The system of claim 19 wherein the text corpus is text of a 
human language. 

21 . (Original) The system of claim 20 wherein the human language is 
Chinese. 

22. (Original) The system of claim 20 wherein filtering the set of frequently 
occurring phrases includes comparing a component word of a phrase to a dictionary 
of common words and excluding the phrase from the set of entity name and jargon 
term candidates if the component word is a common word. 

23. (Original) The system of claim 20 further comprising: 

reducing the set of entity name and jargon term candidates by applying 
natural language processing rules. 



6 



Intel Corporation 

Docket: P11917 



App. No. 10/017,408 



24. (Currently amended) The system of claim 20 23 wherein the natural 
language processing rules are rules selected from the list consisting of 
morphological rules, semantic rules, and syntactic rules. 

25. (New) The method of claim 1, wherein filtering comprises: 
excluding a phrase from the set of frequently occurring phrases, wherein the 

phrase comprises a sub-phrase that occurs at a higher frequency than the phrase. 

26. (New) The machine-readable medium of claim 9 wherein filtering 
comprises: 

excluding a phrase from the set of frequently occurring phrases, wherein the 
phrase comprises a sub-phrase that occurs at a higher frequency than the phrase. 

27. (New) The system of claim 17 wherein filtering comprises: 
excluding a phrase from the set of frequently occurring phrases, wherein the 

phrase comprises a sub-phrase that occurs at a higher frequency than the phrase. 
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Conclusion 



The foregoing is submitted as a full and complete response to the Notice of 
Non-compliant Amendment. Applicant submits that the application is in condition for 
allowance. Reconsideration is requested, and allowance of the pending claims is 
earnestly solicited. 

Should it be determined that an additional fee is due under 37 CFR §§1.16 or 
1.17, or any excess fee has been received, please charge that fee or credit the 
amount of overcharge to deposit account #02-2666. If the Examiner believes that 
there are any informalities, which can be corrected by an Examiner's amendment, a 
telephone call to the undersigned at (503) 439-8778 is respectfully solicited. 



c/o Blakely, Sokoloff, Taylor & Zafman, LLP 

12400 Wilshire Blvd. 
Seventh Floor 

Los Angeles, CA 90025-1030 
(408) 720-8300 



Respectfully submitted 




Vincent H. Anderson 
Reg. No. 54,962 



I hereby certify that this correspondence is being deposited with the United States 
Postal service as first class mail with sufficient postage in an envelope addressed 
to: Commissioner for Patents, P.O. Box 1450 Alexandria, VA 22313-1450 
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